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Weak measurement has provided new insight into the nature of quantum measurement, by demon¬ 
strating the ability to extract average state information without fully projecting the system. For 
single qubit measurements, this partial projection has been demonstrated with violations of the 
Leggett-Garg inequality. Here we investigate the effects of weak measurement on a maximally 
entangled Bell state through application of the Hybrid Bell-Leggett-Garg inequality (BLGI) on a 
linear chain of four transmon qubits. By correlating the results of weak ancilla measurements with 
subsequent projective readout, we achieve a violation of the BLGI with 27 standard deviations of 
certainty. 


Quantum computing promises greater processing 
power through the clever application of superposition 
and entanglement. Despite the importance of this 
uniquely quantum behavior, it occurs elusively behind 
the non-unitary effects of measurement collapse. Weak 
measurements[1-3] have provided new insight into this 
process by demonstrating the ability to extract aver¬ 
age state information without fully collapsing the sys¬ 
tem. These gentler measurements have allowed single¬ 
configuration violations of the Leggett-Garg inequality 
[4-11] and, more recently, the detailed tracking of single 
qubit trajectories [12, 13]. It is an outstanding challenge 
however, to achieve the same level of measurement con¬ 
trol with an entangled state. Here we demonstrate a con¬ 
tinuous and controlled exchange between extracted single 
qubit state information and two-qubit entanglement col¬ 
lapse, through the unique framework of the Bell-Leggett- 
Garg inequality (BLGI). We quantify this effect by corre¬ 
lating variable strength ancilla qubit measurements with 
subsequent projective readout to collect all the statistics 
of a Bell inequality experiment [14-17] in a single quan¬ 
tum circuit. Additionally, we demonstrate the ability to 
measure the Bell state with minimal entanglement col¬ 
lapse, by violating this hybrid BLGI[18] at the weakest 
measurement strengths. This experiment indicates that 
it is possible carry out high fidelity ancilla measurement 
in large entangled systems. Additionally, combining this 
experiment with remote entanglement methods[19] may 
eventually lead to a loophole-free violation of classical 
hidden variable theories. 

The challenge of successfully implementing weak mea¬ 
surements is twofold: the first is to evaluate the amount 
of information extracted on average by the measurement; 
the second is to evaluate the measurement back-action 
on the system. For a single qubit state, the Leggett- 


Garg inequality [20] (LGI) provides an elegant way to 
do both with a single experiment. The LGI was orig¬ 
inally designed to verify the “quantumness” of macro¬ 
scopic objects through the effects of projective measure¬ 
ment, which allows larger correlations between successive 
measurements (e.g., at times t\ <t 2 < £3) than are pos¬ 
sible classically. More recent generalizations of the LGI 
prepare a known state at time t\ and replace the interme¬ 
diate measurement at time t 2 with a weak measurement 
[4-11]. This minimizes the quantum state disturbance 
while still extracting sufficient information on average. 
Ideally this allows all the statistics necessary for a vi¬ 
olation of the inequality to be measured with a single 
experimental configuration. Violating the inequality in 
this way guarantees that the state information has been 
extracted without significant back-action on the system 

[7]. 

Evaluating the effect of weak measurements on an en¬ 
tangled state is more difficult because the degree of entan¬ 
glement is in generally challenging to quantify. The most 
robust method for quantifying entanglement remains a 
Bell test [14], which was first proposed by Bell and later 
refined by Clauser, Horne, Shimony, and Holt into an in¬ 
equality (CHSH). The CHSH term sums the correlation 
measurements of two spatially separated qubits in four 
different measurement bases and bounds the maximum 
total value of classical correlations to be |CHSH| c i ass < 2. 

In superconducting qubits, we use qubit state rotations 
to map the desired measurement basis onto the ground 
(|0» and excited (|1)) states of the system. For measure¬ 
ment rotations a (qubit 1) and b (qubit 2), the correlation 
amplitude between two measurements is given by 

E(a, b) = P{ 00) - P(10) - P(01) + P(ll), (1) 
where the term P(00) is the probability both qubits are 
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Figure 1. Schematic of the hybrid Bell-Leggett-Garg inequal¬ 
ity and optical micrograph of the superconducting quantum 
device. The algorithm consists of two LGI weak measure¬ 
ment branches, bridged by the entanglement of the central 
Bell qubits. The Bell pair (Pi, 2 ) is initially prepared in the 
anti-symmetric singlet Bell state Next, each Bell qubit 

is rotated to its first measurement basis (a or b) and entan¬ 
gled with its ancilla qubit ( 01 , 2 ). Finally, the Bell qubits are 
rotated and projectively read out in bases corresponding to 
angles a' and b'. By correlating the final projective read out 
and the weak ancilla measurements we calculate all four terms 
of a CHSH correlator simultaneously. 

in the ground state. A traditional CHSH experiment 
combines the four correlator terms 

CHSH = E(a, b) + E(a\ b) + E(a, b') - E(a\ b'). (2) 

Entangled quantum states can violate the classical 
bound, with a fully entangled Bell state ideally saturat¬ 
ing the quantum upper bound of |CHSH| quant < 2\/2 at 
the specific rotation angles a = 0, b = 7r/4, a' = tt/2, and 
b' = 37t/4. 

To understand the effect of weak measurement on an 
entangled state we combine the spatial correlations of a 
Bell inequality with the temporal correlations of an LGI 
to construct a Bell-Leggett-Garg inequality (BLGI). The 
algorithm, as described by Dressel and Korotkov [18] and 
shown in Fig. 1, consists of a CHSH-style experiment in 
which each Bell qubit is measured twice in succession 
as for a simultaneous LGI [7, 10, 21]. The initial mea¬ 


surements are carried out by ancilla qubits which act as 
probes of the entangled system before being projectively 
read out. By varying the degree of entanglement between 
each ancilla and the Bell qubit it probes, we can vary the 
strength of the measurement. After preparing the Bell 
qubits in the anti-symmetric singlet state 

|*-> = -^(|01>-|10>), (3) 

each Bell qubit (Pi, 2 ) is rotated to its first measurement 
angle (a = 0, b = n/4) and then entangled with its ancilla 
qubit ( 0 : 1 , 2 ) to implement the tunable-strength measure¬ 
ment. Next, each Bell qubit is rotated to its final mea¬ 
surement angle (a' = tt/ 2, b' = 37r/4), and all four qubits 
are read out. With this procedure the data for each mea¬ 
surement angle is encoded on a distinct qubit (a —> aq, 
b —> OL 2 -, a 1 Pi, and b 1 —* P 2 ). The BLGI correlator 
then takes the form similar to Eq. (2) 

(C) = —E(a\,a2) — E(ai, P 2 ) + E(Pi,ai2) — E(Pi, P 2 ). 

(4) 

where each term is calculated as in Eq. (1). 

The BLGI bounds correlations between these four dis¬ 
tinct measurements based on the classical assumptions 
of local-macro-realism [18, 22]. Classically the measure¬ 
ment of one qubit in the Bell pair should have no affect 
on the other, and the strength of the ancilla measure¬ 
ment should have no effect on the result of the follow¬ 
ing projective readout. By encoding the measurement 
result for each rotation angle on an independent qubit, 
we can test both of these assumptions at the same time. 
Quantum mechanically if the ancilla measurement is fully 
projective then we should measure the expected correla¬ 
tion amplitude of \E(oti, 0 * 2 )| = — l/v^2 only in the initial 
measurement basis, as the Bell qubits are no long entan¬ 
gled after that measurement. As the ancilla measure¬ 
ment strength is decreased, we should extract the same 
qubit information on average while only partially collaps¬ 
ing the Bell state. For sufficiently weak measurements, 
the magnitudes of all four correlators should approach 
the unperturbed Bell state values of 1/ \/2 while (C) ap¬ 
proaches 2i/2. Thus a violation implies that our system 
demonstrates non-classical correlations through the en¬ 
tanglement of the Bell pair, while also demonstrating the 
ability to extract average state information without sig¬ 
nificant back-action on the entanglement of the system. 

We performed this experiment on a linear chain of 
Xmon transmon qubits, shown at the top of Fig. 1, with 
ground to excited state transition frequencies in the 4- 
6 GHz range [23]. Each qubit is individually addressed 
with a microwave control line which can be used for single 
qubit X or Y gates as well as a DC line for implementing 
Z-gates and frequency control. These control lines are 
used in conjunction to execute high fidelity two-qubit 
gates [24] for entanglement and ancilla measurement. 
The state of each qubit is measured independently us¬ 
ing the dispersive shift of a dedicated readout resonator. 
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Figure 2. Weak measurement protocol, a, Full pulse sequence 
of the ancilla measurement algorithm used in the BLGI ex¬ 
periment. The measurement consists of a variable amplitude 
Y rotation by an angle 4> which controls the strength of the 
measurement. This is followed by a CZ gate that entangles 
the ancilla qubit with the target qubit. Finally the ancilla is 
rotated by an angle — 7 t/ 2, bringing it into the desired mea¬ 
surement basis. Two cases are compared, that of the tar¬ 
get qubit in the ground (blue) or excited (red, 7r rotation) 
state, b, Bloch sphere representation of the ancilla qubit 
during the weak measurement protocol when the target qubit 
is in either the ground (blue) or excited (red) state. The Z 
averages of the ancilla and target qubit are correlated such 
that (Z) a = sin(< fi)(Z) T , where a full projective measurement 
corresponds to (j> = 7r/2 and no measurement corresponds to 
0 = 0. c, Ancilla measurement of prepared target state before 
and after calibrating for measurement strength. We calibrate 
both curves by the scaling factor required to normalize the 
average 0 state curve. This is almost equivalent to dividing 
by sin(</>) but bounds the calibrated mean by ±1. In the cali¬ 
brated case, the measured mean remains unchanged while the 
measured variance increases as </> decreases. The gold shaded 
region denotes angles at which weak measurement data can 
violate the BLGI while still being reliably calibrated 


Resonators are frequency multiplexed [25] and readout 
with a broadband parametric amplifier [26], which al¬ 
lows for fast high fidelity measurement. Further details 
of this device can be found in Refs. 22 and 23. 

The ancilla measurement protocol used in this exper¬ 
iment and shown in Fig. 2, is a modified version of the 


protocol demonstrated in an LGI violation from Groen 
et. al. [10]. Initially, an ancilla qubit is U-rotated by 
an angle 0 < <p < 7r/2 from its ground state to set the 
measurement strength. A control phase gate is then per¬ 
formed, causing a Z rotation of n/2 in the ancilla qubit 
depending on the target qubit’s state. Finally a —Y ro¬ 
tation of 7t/2 is performed on the ancilla qubit to rotate 
into the correct measurement basis. The visibility of this 
measurement is then proportional to the distance of the 
ancilla state vector from the equator of the Bloch sphere, 
as shown in Fig. 2 (b). When <p = n/2 this operation 
becomes a control-NOT gate and implements a projec¬ 
tive measurement. As <fi —0 the ancilla states become 
degenerate and no information is extracted. 

As the final position of the ancilla state is dependent 
on the measurement strength, the ancilla readout is im¬ 
perfectly correlated with the target qubit. That is, the 
visibility of an ancilla Z average, (Z) a ~ sin (cf>)(Z) T , is 
compressed from the target Z average by a factor of ap¬ 
proximately sin(</>). To reconstruct the target Z average 
from the ancilla Z average, we should thus rescale the 
signal by l/sin(^>). Initially, this was done by a linear 
fit of the rotation angle (f> to the qubit drive amplitude. 
Unfortunately this linear fit was too rough to properly 
calibrate the smallest drive amplitudes and led to a sys¬ 
temic offset in the calibration of (Z). To keep the model 
as simple as possible, we kept the linear fit but used a 
data-based rescaling to set the measured ground state 
(|0» average to 1, which ensures that the calibrated an¬ 
cilla average is properly bounded by ±1, as shown in 
Fig. 2(c). Further details of this calibration can be found 
in the supplemental information [22]. 

After this and other extensive system calibrations, we 
can begin our investigation in earnest. To quantify the 
entanglement collapse, we measure each two-qubit cor¬ 
relator in ( C) vs. ancilla measurement strength </>. The 
data are plotted in Fig. 3 alongside theory curves gener¬ 
ated by a quantum model that includes realistic environ¬ 
mental dephasing and readout fidelity [18]. Error bars for 
the data represent ±10 standard deviations of the mean 
to demonstrate the increase in noise with decreasing mea¬ 
surement strength. For projective angles </> ss n/2, the 
ancilla measurement results (E(a i, 0 : 2 )) reflect the corre¬ 
lation expected from a fully collapsed Bell pair. As mea¬ 
surement strength is decreased, this ancilla correlation 
remains nearly constant while additional inter-qubit cor¬ 
relations {E(ai, (3 2 ), E(P 1 ,a 2 ), E(/3 1 ,P 2 )) emerge. For 
sufficiently weak measurements, ( C) exceeds the classi¬ 
cal bound of 2 and saturates towards the CHSH value of 
2.5, which is expected from simulations for a fully entan¬ 
gled Bell state in realistic experimental conditions [22]. 

The measured BLGI correlations follow the theoreti¬ 
cal model [22] very closely for all measurement strengths. 
This behavior reveals the continuous and controlled ex¬ 
change between the collapse of an entangled Bell state 
and the single qubit state information gained from 
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(c)= E(a 2 ,p 1 )-E(a 1 ,a 2 )-E(a 1 ,3 2 )-E(Pi,(3 2 ) 
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Figure 3. Graph showing both experimental data (points) 
and theoretical predictions (lines) for the correlator (C) and 
its four terms vs. measurement strength <j>. The horizontal 
gold line denotes the classical bound on ( C). The data set was 
taken by averaging together 200 traces in which each point 
was measured 3000 times for a total of 600,000 iterations per 
point. The error bars represent 10 standard deviations of 
the mean to demonstrate the scaling the ancilla measurement 
noise vs. <j>. The magnitude of the correlations between each 
pair of qubits reveals the extent to which entanglement has 
been broken for each measurement strength. 

tunable-strength measurements. Each ancilla qubit, 
when calibrated, retains the same correlations for all 
measurement strengths, whereas each Bell qubit has its 
correlations damped through partial projection by its 
ancilla qubit [18]. The effect of partial projection can 
be seen in the difference in functional behavior between 
the Bell-ancilla (E(ai, fa), E(/3i, a^)) and the Bell-Bell 
(E(/3i, fa)) correlator terms. In the Bell-ancilla terms, 
the correlations are suppressed solely due to the random¬ 
ization of the Bell qubit, but return as soon as measure¬ 
ment strength is decreased. In the Bell-Bell term, this 
effect is compounded as both qubits are being damped 
by partial ancilla projection, so the correlations return 
more slowly. This gives JT(/3i, /? 2 ) its distinct shape com¬ 
pared to the other correlators. The BLGI terms can be 
seen in greater detail in the supplement [22]. 

While other methods exist to characterize entangle¬ 
ment, the correlation measurements of a CHSH exper¬ 
iment remain one of the more robust tests of quantum 
behavior due to the considerations given to experimen¬ 
tal loopholes. Fortunately, the unique construction of 


the BLGI allows us to avoid some of the more pervasive 
loopholes appearing in traditional Bell or LG inequalities. 
The simultaneous measurement of all four CHSH terms 
in a single circuit allows us to avoid any configuration- 
dependent bias, such as the disjoint sampling loophole 
[27]. The near unit detection efficiency in superconduct¬ 
ing systems [22, 28] similarly bypasses the fair sampling 
loophole [29], which has hindered the investigation of re¬ 
lated hybrid inequalities in optical systems [9, 30]. Ad¬ 
ditionally, since the data from each ancilla qubit is only 
correlated with the data from the Bell and ancilla qubits 
on the remote LGI branch, we can substantially relax 
the usual LGI noninvasive measurement assumption to 
the standard locality assumption needed for a Bell in¬ 
equality instead. 

This locality assumption, fundamental to any Bell in¬ 
equality, presumes no classical interactions between re¬ 
mote qubits occur during the correlation measurements. 
The close proximity of adjacent superconducting qubits 
on a chip implies that such an interaction cannot be ruled 
out here. Thus, behavior that appears to be quantum 
could, at least in principle, be the result of a fast clas¬ 
sical interaction between hidden variables in the system. 
While we cannot yet completely rule out these local inter¬ 
actions, there are several promising approaches to closing 
this loophole as well. The assumptions of the BLGI re¬ 
quires only spatial separation of the central Bell qubits. 
The speed and fidelity of operations in a superconducting 
qubit system makes modest spatial separation sufficient, 
and we can sacrifice some Bell state preparation fidelity 
to achieve it. Techniques such as remote entanglement 
through measurement [19], may soon provide the spatial 
separation necessary to conduct a loophole-free BLGI ex¬ 
periment. 

Despite the few remaining loopholes, the excellent 
agreement between data and theoretical predictions in 
this experiment allows us to draw certain likely conclu¬ 
sions about the application of ancilla measurement in 
superconducting circuit systems. The functional depen¬ 
dence of the BLGI correlator on measurement strength 
implies that the back-action is dominated by the pro¬ 
jectiveness of the measurement. This makes it unlikely 
there is some poorly understood classical error mecha¬ 
nism which would make it difficult to implement error 
correction schemes such as the surface code [31], which 
rely on sequential high fidelity ancilla measurements to 
detect errors [23]. The violation of the BLGI at the weak¬ 
est measurement strength also implies that it is possible 
to extract average state information without significant 
back-action. Normally the usefulness of this type of mea¬ 
surement is limited by the large number of statistics re¬ 
quired to average the noisy detector output and the co¬ 
herence time of the qubits limiting the total number of 
measurements. It should be possible however, to inte¬ 
grate weak ancilla measurements into a large surface code 
cell, which would correct for most errors while also allow- 
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ing for a larger ensemble to be collected from the weak 
measurement. Thus weak ancilla measurement could be¬ 
come an important tool in understanding the dynamics 
of large quantum systems. 

In the course of this work we have demonstrated the 
continuous and controlled collapse of an entangled state 
based on the strength of tunable ancilla measurements. 
This behavior was quantified using the simultaneous cor¬ 
relation measurements which make up the Bell-Leggett- 
Garg inequality. The violation of this inequality at the 
weakest measurement strengths demonstrates the viabil¬ 
ity of using weak ancilla measurements to conduct many 
sequential measurements of entangled states. This pro¬ 
vides a window into the evolution of entangled states, 
a critical component in scaling to larger quantum sys¬ 
tems. With the inclusion of new remote entanglement 
algorithms the BLGI may also lead to a loophole-free vi¬ 
olations of classical hidden variable theories. Lastly this 
demonstrates that as we scale to larger multi-qubit sys¬ 
tems, with the fidelity and control achieved here, we gain 
greater access to the rich physics at the heart of quantum 
mechanics. 
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SUPPLEMENTAL INFORMATION 


Bell and Leggett-Garg Inequalities 

The CHSH correlator, designed by Clauser, Home, Shimony, and Holt[l] as a refinement of 
the Bell inequality[2], provides a quantitative bound on classical hidden variable theories using 
correlated measurements between two spatially separated qubits. The correlator combines four 
different experimental configurations because it can be difficult to tell the difference between po¬ 
tentially classical (un-entangled) qubits and an entangled state in only one basis. With supercon¬ 
ducting qubits, the measurement basis for each qubit is set using qubit rotations to map the desired 
state onto the ground (|0)) and excited (|1)) states of the system. For measurement rotations a 
(qubit 1) and b (qubit 2), shown in Fig. 1(a), the correlation amplitude is given by 

E(a, b ) = P(00) - P(10) - P(01) + P(ll), (1) 

where P(00) is the probability both qubits are in the ground state. Given this equation we can see 
that both the Bell state |<3> + ) = (100) + | ll))/\/2 and the prepared state |00) will have a correlation 
amplitude of 1 if a = b = 0. The difference only becomes clear when the detector angle of one 
qubit is rotated relative to the other. The behavior of E(a, b) vs detector rotation, described here 
as 6 = a — b, is shown in Fig. 1(b) for both the classical and quantum case. If the two objects can 
be described separately, then E is only a linearly dependent on 6. If the two objects are entangled, 
then E is has a sinusoidal dependence on 6 with the maximum difference occurring at 9 = n/A. 

To initially characterize the system we conducted a traditional CHSH experiment using the 
central Bell qubits (/3 i j2 ). The relative measurement angles for each qubit were held fixed such 
that a' = a + 7t/ 2 and b' = b + 7t/ 2. We then varied 9 = a — b from 0 to 7r, and measured each 
individual correlator as well as the sum given by 

CHSH = E(a, b) + E{a!, b) + E(a, b') - E(a', b'). (2) 

For any two classical states measured at these angles, we should see a linear dependence of E{9) 
and a bound on the the CHSH correlator of \CHSH\ < 2. Alternately, if the two qubits are in 
a maximally entangled Bell state, we should see sinusoidal behavior for E{9) and a maximum 
CHSH value of 2y/2. The data, shown in Fig. 1(c), display the expected sinusoidal dependence 
for each individual term, with a maximum CHSH amplitude near 9 = tt/A. While this data shows 
a robust violation of the classical bound, it fails to reach the theoretical maxium bound of 2\/2. 
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Figure 1. Entanglement and the CHSH correlator, (a) Schematic of a typical CHSH experiment. Two qubits 
are prepared in one of the four possible Bell states and measurements are conducted at various measurement 
angles a, b, a' = a + tt/2, and // = b + tt/2. (b) Dependence of correlation amplitude (Eq. 1) on detector 
angle difference 6 = a — b. The difference between quantum and classical correlation is maximized at 
6 = 7t/4. (c) Measurement of CHSH correlator using the central Bell qubits prepared in a |^> + ) Bell 
state, displaying the sinusoidal dependence on the relative measurement angle 9 . This experiment avoids 
common sampling loopholes with high detector efficiency, but is subject to the locality loophole due to the 
small spatial separation of adjacent superconducting qubits on a chip. 

The maximum CHSH amplitude of ~ 2.5 we see here is due to experimental imperfections which 
will be discussed later. This CHSH experiment provides the framework for the BLGI, as well as a 
benchmark for the maxium violation we should expect at the weakest measurement angles. 

A complementary test of quantum mechanics is the LGI, which is similar to a Bell inequality 
but involves measurements separated in time rather than in space. Classical theories of measure¬ 
ment assume that the system is always in a definite state, and that an ideal measurement will not 
change the state of the system. In contrast, if one were to measure a quantum state in an orthogonal 
measurement basis, the act of measurement would project that object onto an eigenstate of the new 
basis. To distinguish one kind of system from the other, measurements are conducted in different 
bases at different times. For measurements conducted at times t 1 < t 2 < t 3 , we can construct 
correlators analogous to Eq. 1 but for different measurements of the same qubit, 

E(ti, tj) = P(00) - P(10) - P(01) + P(ll). (3) 

The inequality was originally composed of three distinct experiments. In the first experiment, the 
system is measured projectively at time ti, followed by a final projective measurement at time t 3 . 
A second experiment is then carried out where an intermediate measurement in a different basis is 
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conducted at time t 2 instead of time t : >. The third experiment consists of only the measurements at 
times f 2 and t.>. The LGI is then given by 


— 3 < E(ti,t 2 ) + E(t 2 , t 3 ) — E(ti,t 3 ) < 1 (4) 

where E(ti,t 3 ) is the experiment in which no measurement is performed at time t 2 . Further details 
for LGIs can be found in the review article by Emary et al. [3]. 

The weak measurement techniques discussed in the main text were created to avoid the pos¬ 
sibility of a “clumsy” measurement loophole [4]. When sequential measurements are performed 
on the same system, it is impossible to ensure that LGI violations are not due to overly invasive 
measurements perturbing the system in an unknown way. To minimize the effect of measurement, 
most LGIs replace the measurement at time t\ with preparation of a known state, and the mea¬ 
surement at time t 2 with a null [5] or weak [6, 7] measurement. These weak measurements [6, 7] 
minimize back action on the system, while still extracting enough information to identify its state. 
Using this technique, all the statistics of the LGI can be measured by conducting all three “mea¬ 
surements” in a single experimental configuration. To construct the Bell-Leggett-Garg inequality 
we combined a traditional CHSH experiment with this weak measurement technique. This allows 
us to measure all four terms of the CHSH correlator simultaneously in a single experiment. 

BLGI Algorithm Assumptions and Loopholes 

The fundamental assumptions of the hybrid Bell-Leggett-Garg inequality are those of local 
realism, which are familiar from the Bell inequalities: 

(i) If an object has several distinguishable physical states A, then at any given time it occupies 
only one of them. 

(ii) A measurement performed on one object of a spatially-separated pair cannot disturb the 
second object. 

(iii) Measured results are determined causally by prior events. 

Note that only assumption (ii) differs from the notion of macrorealism used in Leggett-Garg 
inequalities: it is weakened here to permit local invasiveness for sequential measurements in time 
made on the same object, while still forbidding spatially remote measurements from influencing 
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each other. Note that the assumed physical state A may be related to the quantum state, or may be 
a collection of more refined (but unspecified) hidden variables. 

To these core assumptions we must append one more to permit noisy (i.e., realistic) detectors: 

(iv) Unbiased noisy detectors produce results that are correlated with the true object state A on 
average. 

This assumption can be understood as follows. The object state A ideally determines each measur¬ 
able property A(A), but a physical detector (and environment) that interacts with the system will 
also have a distinct physical state £ that may fluctuate noisily between realizations (e.g., from the 
coupling procedure). In such a case the detector will report a correspondingly fluctuating signal 
a(£) according to some response probability Pa(£|A) for obtaining the detector state £ given each 
definite system state A. For any sensible detector, these response probabilities will be fixed by 
the systematic and repeatable coupling procedure (such as our ancilla measurement circuit). To 
calibrate such a detector, we must then assume that averaging over many realizations of the detec¬ 
tor noise will faithfully reflect information about each prepared system state A (even if that state 
ultimately changes for subsequent measurements due to the coupling): 

£^(£)P a (£|A) = A(A). (5) 

Importantly, this equality formally states only what is usually assumed for an unbiased laboratory 
detector: that one can recover a meaningful system value A(A) by averaging away any detector 
noise. 

Now consider the Bell-Leggett-Garg correlation. A correlated pair of objects with the joint 
state A is sampled from an ensemble with the distribution P{ A). (In our experiment, we prepare 
two qubits in a Bell state.) At a later time each object (k = 1, 2) is coupled to a detector (an 
ancilla qubit) that outputs a noisy signal a:/,, calibrated to measure the bounded property A k ( A) £ 
[—1,1] on average (the Z operator for each Bell qubit). The noisy signal a k generally has an 
expanded range of values that can lie outside the range [—1,1] (in our case a k ~ ±l/sin0); 
however, for each A the realizations of the output signal will average to the correct bounded value 
by assumption (iv). (We verify this assumption with the ancilla calibration measurements using 
definite preparations of 0 or 1 on the Bell qubits.) Finally, each object is measured with a second 
detector that outputs a signal b k for a similarly bounded property B k {£) £ [—1,1] (we read out 
the qubits directly to obtain b k = ±1). From these four measured numbers, we then compute the 
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CHSH-like correlator as a single number for each preparation 


C — — OL\Ot,2 — C>n&2 + b\Oi2 — &1&2- (6) 

The expanded ranges of the noisy signals a k generally produce a similarly expanded range for 
the correlator C for each preparation. Nevertheless, averaging C over many realizations of the 
detector noise ^A k and £ tBk and system states A will produce 

(C> = E»E«.,4» 1 CP(U,U|A)P(U,fe|A)P(A), 

?a 2 ,?b 2 

= Ea [ - A(A)A 2 (A) - A,(A)B'(A) + B;(A)A 2 (A) - B',(A)B'(A)] B(A), (7) 

with Afc(A) = Et,,^,«*(U)P(U.U|A) and = Y. iAk , iBk M&J P(U,foW. 

since postulate (ii) causes the joint distribution of the detector states to factor: P(g Al ■ • £,a 2 - £b 2 I A) 

P(£ai, €bi |A)P(U,6fc|A) in the same way as for a Bell inequality. From the postulates (i), (iii), 
and (iv), the averages A k (X) and B k ( A) are then bounded to the range [—1,1]. Therefore, for each 
A the sum of the bounded averages in Eq. (7) must itself be bounded by [—2, 2], Averaging this 
bounded result over P(A) produces the expected BLGI 

-2 < (C) < 2. (8) 

Importantly, the joint probability P(^ fc , £bJ A) = P(t,A k \ A)P(^s fc | A, £ Ak ) for each qubit k ad¬ 
mits the dependence of the B' k measurement on an invasive A k measurement that can alter the 
physical state A. Despite any randomization of the results b f JAn k ) caused by such local invasive¬ 
ness, however, the perturbed averages B' k { A) must still lie in the range [—1,1] since each b k = ±1 
by construction. This allowance for locally invasive measurements in the BLGI is what avoids the 
clumsiness loophole [4] of the usual LGI. The fact that the entire correlator C is computed for 
every realization in the same experimental configuration is what avoids any variant of the disjoint 
sampling loophole [8] for the usual Bell and LGIs (such as from systemic bugs in the preparation 
software). 

There are, however, two notable ways that our derivation of the BLGI in Eq. (8) could fail. 
Lirst, the assumptions (i—iii) of local realism could fail, as in a standard Bell inequality. This is 
certainly possible in our case since the Bell qubits are neighbors on the same superconducting chip. 
However, arranging for a locally realist model that accounts for the needed disturbance effects for 
the neighboring Bell qubits, the neighboring Bell-ancilla qubits, each remote pair of Bell-ancilla 
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qubits, and the remote ancilla-ancilla qubits simultaneously is substantially more difficult (and 
therefore much less likely) than arranging for such disturbance in the usual Bell test on just two 
neighboring qubits. Moreover, our experiment verifies the detailed functional dependence of the 
quantum predictions as the weak measurement angle 4> is varied, which further constrains any 
purported locally realist explanation. Thus our tested BLGI significantly tightens the locality 
loophole [9] compared to the usual Bell test performed on the same chip. 

Second, the noisy detector assumption (iv) could fail due to hidden preparation noise £p not 
included in the state A that systematically affects the detector output in both arms in a correlated 
way. In this case, the detector response would become noise-dependent P^(£|A) —> P 4 (£|A, £p) 
such that the calibration of Eq. (5) will be satisfied only after additionally averaging over £p. 
Such correlated noise would prevent the detector distributions from factoring for each A in Eq. (7), 
which formally spoils the inequality. However, in our experiment such a systematic bias due 
to correlated noise has been extensively checked during the characterization of the chip and the 
measurement calibration by deliberately preparing a variety of uncorrelated distributions P(A) 
(i.e., different qubit states) and looking for spurious cross-correlations of the various qubit readout 
signals that would be expected in the presence of such hidden preparation noise. Hence, the 
failure of assumption (iv) additionally requires an unlikely preparation-conspiracy where every 
calibration check that has been done is somehow immune to the hidden detector-noise correlations. 


Weak Measurement Calibration 

As discussed in the main text the ancilla readout is imperfectly correlated to the Bell qubit’s 
state. When measuring in the Z basis, ( Z) a = sin (cj))(Z)p, shown in Fig. 2 (a) along with the 
ideal curves ± siu(o). To calibrate this weak measurement we must first relate the measurement 
angle 0 to microwave drive power, by fitting to a measurement of |1) state probability vs. tt -pulse 
amplitude. The most straight forward calibration would then be to divide ( Z) a by sin(0) shown 
in the blue curves in Fig. 2 (b), but this method causes drift in the mean at the smallest angles. 

This simple calibration fails because the raw data curves shown in Fig. 2 (a) converge to a value 
slightly below zero. This means that for the weakest measurements, the simple angle calibration 
will under-correct a |1) state measurement and over-correct a |0) state measurement. This over¬ 
correction of the |0) state is problematic, since calibrated values for (Z) a not bounded by ±1 will 
possibly violate the inequality incorrectly. To prevent this, we instead use a data based calibration 
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measurement strength <J>/7t 


measurement strength §ln 


Figure 2. (a) Raw measurement of (Z) vs. measurement strength <j> for both a\ and a 2 along with an ideal 
curve of ± sin(^). Due to decoherence, measurement eiTor, and possible calibration error the data does not 
perfectly follow the ± sin(ci) curves and converges to a value slightly below 0 at the weakest measurement 
strength, (b) |0) state calibrations done using both the 1/ sin(0) method (blue) and the point by point 
calibration (green). In the 1/ sin(<j)) calibration the mean of the |0) state measurement drifts above 1 at the 
weakest measurement angles, while the mean follows 1 for the point by point case. 


for each ancilla using the average of the |0) state measurement curve. This has the advantage of 
bounding the mean of the calibrated result by ±1, at the expense of accentuating the drift in the 
mean of the |1) state measurement towards 0. The results of this |0) state calibration are shown in 
the green curves in Fig. 2 (b). 

To apply this calibration to the correlator terms, we must first express them in terms of the 
measurement operator (Z ). In a superconducting system the state rotations are used to map the 
desired measurement basis onto the ground (|0)) and excited (|1)). For the ancilla measurement 
this is equivalent to mapping onto the Z measurement axis. Given probability P(l) of measuring 
the excited (11)) state, (Z) = 1 — 2P(1). After mapping state probabilities onto the Z measurement 
axis we can express the correlator as E(a, /3) = {Z) a (Z)p. Expressed in this way we can see that 
for calibration factor cal{4>) ~ 1/sin(0), E ca i(a, j3) = E(a,/3) * calio). Extending this to the 
BLGI we calibrate each term depending on the ancilla qubit being measured such that E(a±, f3 2 ) —>■ 
E(a±, /3 2 )*cal((j) i), E(/3 1 ,a 2 ) —>• E(/3i, a 2 ) *cal(4> 2 ), E(cti,a 2 ) —>• E(a±, a 2 ) * cal (fa) *cal(cj) 2 ), 
and E((3i, 3 2 ) remains unchanged. 
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Error Analysis and Pulse Sequence Optimization 


While the algorithm and weak measurement scheme are simple in design, dependence on cor¬ 
relations between multiple qubits makes (C) sensitive to multiple error mechanisms. As all four 
qubits were operated away from the flux insensitive point they were more susceptible to dephasing 
effects. The Xmon qubits used in this experiment were extensively characterized in Ref. [10], 
with a T 2 Ramsey decay of order 2 — 3 /is at the idle point. When Characterizing the phase er¬ 
ror per gate with randomized benchmarking, we find an error of roughly 0.25 percent per gate. 
This corresponds to a T 2echo of over 10 /is. The amplitude of (C) vs. dephasing error per qubit is 
shown in Fig. 3(a). The violation amplitude is relatively robust to this error, and can sustain error 
rates of up to 30 percent while still exhibiting non-classical correlations. The second major error 
mechanism was reduced measurement visibility coming from T\ energy decay or spurious 11) state 
population. The effect on ( C ) vs. single qubit measurement visibility is shown in Fig 3(b). The 
correlation amplitude is more sensitive to this reduced measurement visibility and is significantly 
degraded at even 90 percent. In both cases, the presence of errors not only lowers the maximum vi- 


(a) (b) 




Figure 3. Effect of error mechanisms on BLGI correlations, (a) Amplitude of (C) vs. measurement strength 
for various single qubit dephasing error rates. The single qubit dephasing error rate can be thought of as 
roughly T 2 /( gate time) in the simplest case, but can be improved through the addition of echo pulses, (b) 
Amplitude of (C) vs. measurement strength for various single qubit measurement visibility values. The 
system shows a greater sensitivity to this error mechanism and cannot tolerate a visibility much below 90 
percent. 
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Figure 4. (above) Full pulse sequence of the BLGI algorithm, including spin echo pulses on the Bell qubits 
to reduce dephasing. The weak measurements were carried out simultaneously to further streamline the 
algorithm, (below) Frequency diagram of the 4 qubits showing their placement in frequency space the pulse 
sequence, in particular the various detunings during the adiabatic CZ gates. 

olation possible but the highest measurement strength at which a violation first occurs. As weaker 
measurement angles require finer calibration and provide noisier data, it is preferable to achieve a 
violation at the largest measurement strength possible. 

Given the sensitivity of (C) to various decoherence mechanisms it was important to reduce the 
BLGI pulse sequence time as much as possible for higher coherence. This is most notable during 
the weak measurement portion when we carry out simultaneous adiabatic CZ gates[ll] between 
both ancilla-Bell pairs. Lastly, we introduced spin echo pulses in the middle of the algorithm which 
cancel out dephasing during the pulse sequence while simply transforming the original |T + ) Bell 
state to a |\K - ). To maximize measurement fidelity, we used a wide bandwidth parametric amplifier 
[12], to ensure a high signal to noise ratio and shorter readout time. A separate measurement at 
the beginning of the pulse sequence was used to herald [13] the qubits to the ground state, but this 
was a small (~6 percent) effect. Lastly, we implemented numeric optimization of the adiabatic 
CZ gates using the ORBIT protocol [14] to fine tune parameters for the final data set. The full 
Pulse sequence and frequency placement of the qubits during the algorithm is shown in Fig. 4. 

During the numeric optimization of the pulse sequence, single qubit phases can be adjusted 
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Figure 5. Graph showing both experimental data (points) and theoretical predictions (lines) for the four 
BLGI terms vs. measurement strength cj). The data set was taken by averaging together 200 hundred traces 
in which each point was measured 3000 times for a total of 600,000 iterations per point. The error bars 
represent 10 standard deviations of the mean to demonstrate the scaling the ancilla measurement noise vs. 
measurement strength. Distinct differences in the behavior arise from experimental imperfections and their 
depedence on different qubit correlations. In particular the difference between E(/3i, /i 2 ) an d E(a\. /3 2 ) or 
E((3 1 , 02 ) highlights the dependence of entanglement collapse on the measurement strength of both ancilla 
qubits. 


slightly to increase correlations, leading to a larger violation. This is equivalent to changing the 
final rotation angle of the detectors slightly (~ 3 degrees). The nature of the BLGI makes it 
immune to such rotations as loss of correlations in one correlator is naturally made up for in 
another, additionally, the initial detector rotation b was chosen based on the maximum of the 
original CHSH measurements. Due to differences in qubit coherence this does not necessarily 
occur at 7t/4, but at a slightly smaller angle. The individual BLGI correlator terms measured 
in this experiment along with theory curves accounting for these realistic rotations are plotted 
in Fig. 5. The behavior of each individual term depends on the type of qubits being correlated. 
The term E(a 1 ,a 2 ) holds roughly constant, close to the expected value of — l/x/2- E{oc i,/ 3 2 ) 
and E(/3i, q: 2 ) start close to zero, and converge to around ±0.5. The behavior of E{Ji \, /±) best 
matches expectations. It begins at 0 for strong ancilla measurement and converges near l/y/2 at 
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perfectly weak measurement, following a slightly s-shaped curve. 


Sample fabrication and characterization 

Devices are fabricated identically to Ref. [11], and extensive calibration was documented in 
Ref. [10]. 

Device parameters 

The device parameters are listed in table I. Note that the coupling rate g is defined such that 
strength of the level splitting on resonance (swap rate) is 2 g (Ref. [15]). 
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Table I. Parameters for the device when running the BLGI algorithm. / are frequencies, rj is qubit nonlin¬ 
earity. g is coupling strength, n is resonator leakage rate. 



Qo 

Qi 

Q-2 

Qi 

Qubit frequencies and coupling strengths 


/iT* (GHz) 

5.30 

5.93 

5.39 

5.90 

77 / 27 r (GHz) 

-0.230 

-0.216 

-0.229 

-0.214 

fit (GHz) 

4.53 

5.42 

4.67 

5.55 

/res (GHZ) 

6.748 

6.626 

6.778 

6.658 

p res /27r (GHz) 

0.110 

0.128 

0.111 

0.109 

flqubit/^TI" (MHz) 

13.8 

14.1 

9qubit/27T (MHz) 


14.5 


1/K res (ns) 

675 

69 

555 

30 

Readout (RO) parameters 



RO error 

0.015 

0.004 

0.067 

0.007 

Thermal 1) pop. 

0.013 

0.007 

0.028 

0.01 

RO pulse length (ns) 

1000 

300 

1000 

300 

RO demodulation length (ns) 

1000 

300 

1000 

300 

Qubit lifetime at idling point 



T\ (/xs) 

26.3 

24.7 

39.2 

21.3 
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